Annotating Table Headers Based on Semantic Web Resources
نویسنده
چکیده
—Tables offer an often used way to represent information for the human reader. But as long as those tables are not annotated with semantic information they are meaningless to machines. In this work a methodology is proposed to annotate the headers of table columns with semantic types by creating a ranking of possible column headers based on the column cells. In the performed experiments on 10 independent columns a mean average precision value of about 0.7 was achieved. Moreover experiments showed that on average only 15 cells have to be considered to gain good results. Therefore performance does not depend on the number of rows of the table.
منابع مشابه
Transforming Arbitrary Tables into F-Logic Frames with TARTAR
The tremendous success of the World Wide Web is countervailed by efforts needed to search and find relevant information. For tabular structures embedded in HTML documents typical keyword or link-analysis based search fails. The Semantic Web relies on annotating resources such as documents by means of ontologies and aims to overcome the bottleneck of finding relevant information. Turning the cur...
متن کاملMUSETTE: uses-based annotation for the Semantic Web
In Tim Berners-Lee’s vision of the Semantic Web, software agents must be able to retrieve knowledge relevant to the end-user’s task, despite the heterogeneity and scale issues which are inherent to the web. Annotating web resources is considered a promising way to achieve this vision. We argue in this chapter that every actual use of a resource may provide useful knowledge about that resource a...
متن کاملA Domain Independent Framework for Extracting Linked Semantic Data from Tables
Vast amounts of information is encoded in tables found in documents, on the Web, and in spreadsheets or databases. Integrating or searching over this information benefits from understanding its intended meaning and making it explicit in a semantic representation language like RDF. Most current approaches to generating Semantic Web representations from tables requires human input to create schem...
متن کاملDisambiguatingWeb Tables using Partial Data
This work addresses disambiguating Web tables annotating content cells with named entities and table columns with semantic type information. Contrary to state-of-the-art that builds features based on the entire table content, this work uses a method that starts by annotating table columns using automatically selected partial data (i.e., a sample), then using the type information to guide conten...
متن کاملTowards Disambiguating Web Tables
Web tables comprise a rich source of factual information. However, without semantic annotation of the tables’ content the information is not usable for automatic integration and search. We propose a methodology to annotate table headers with semantic type information based on the content of column’s cells. In our experiments on 50 tables we achieved an F1 value of 0.55, where the accuracy great...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014